High Quality Scalable Stereo Audio Coding
نویسندگان
چکیده
This paper proposes an efficient, low complexity, scalable audio coder based on a combination of two embedded coding algorithms: the SPIHT (set partitioning in hierarchical trees) coding algorithm [1] and an embedded, nested binary set partitioning (NBSP) algorithm. The SPIHT algorithm, considered to be the premier state-of-the-art algorithm in still image compression, is used for the low frequency subbands in a wavelet packet audio signal decomposition, while the NBSP algorithmencodes the high frequency audio subbands. Both left and right channels are encoded together to form a single embedded stereo audio bitstream, that can be truncated at any point to produce an optimal lower rate and quality bitstream for delivery to lower quality user services. Using standard MPEG test materials, we evaluate the performance of the proposed encoder compared to the MPEG II standard audio coder through informal listening tests at bit rates of 48Kbs/sec and 64Kbs/sec per channel. We conclude that our coder is comparable with MPEG II at 48Kbs/sec and better at 64 Kbs/sec per channel. The algorithm also features exact bit rate control, progressive transmission and low complexity for both the encoder and decoder. These features show its potential for interactive audio transmissionover networks.
منابع مشابه
Low Complexity Parametric Stereo Coding in Mpeg - 4
Parametric stereo coding in combination with a State-of-the-Art coder for the underlying monaural audio signal results in the most ef cient coding scheme for stereo signals at very low bit rates available today. This paper reviews those aspects of the parametric stereo paradigm that are important for audio coding applications. A complete parametric stereo coding system is presented, which was r...
متن کاملParametric Coding of Stereo Audio
Parametric-stereo coding is a technique to efficiently code a stereo audio signal as a monaural signal plus a small amount of parametric overhead to describe the stereo image. The stereo properties are analyzed, encoded, and reinstated in a decoder according to spatial psychoacoustical principles. The monaural signal can be encoded using any (conventional) audio coder. Experiments show that the...
متن کاملProgressive Syntax-Rich Coding of Multichannel Audio Sources
Being able to transmit the audio bitstream progressively is a highly desirable property for network transmission.MPEG-4 version 2 audio supports fine grain bit rate scalability in the generic audio coder (GAC). It has a bit-sliced arithmetic coding (BSAC) tool, which provides scalability in the step of 1 Kbps per audio channel. There are also several other scalable audio coding methods, which h...
متن کاملLow Complexity Decoding in Parametric Stereo Audio Coding Scheme
Parametric Stereo (PS) is an audio coding object of MPEG-4 HE-AAC v2 which utilized the Spatial Audio Coding (SAC) technique to enhance the compressing efficiency. However, the complexity at decoder is higher than that at encoder in PS. In this paper, we proposed a low complexity decoding scheme in PS. To take advantage of SAC, the encoder additionally extracts and transmits the parameters of r...
متن کاملHigh-quality and processor-efficient implementation of an MPEG-2 AAC encoder
Presented here is MPEG-2 AAC LC Profile encoder software for an Intel Pentium III processor. MDCT and quantization processing are accelerated by the use of SIMD instructions. Psycho-acoustic analysis in the MDCT domain makes the use of FFTs unnecessary. Better sound quality is provided by greater efficiency in quantization processing and Huffman coding. All of this results in high-quality and p...
متن کامل